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Abstract 

We consider Ising models on graphs that converge locaUy to trees. Examples include random regular 
graphs with bounded degree and uniformly random graphs with bounded average degree. We prove that 
the 'cavity' prediction for the limiting free energy per spin is correct for any temperature and external 
field. Further, local marginals can be approximated by iterating a set of mean field (cavity) equations. 
Both results are achieved by proving the local convergence of the Boltzmann distribution on the original 
graph to the Boltzmann distribution on the appropriate infinite random tree. 

1 Introduction 

An Ising model on the finite graph G (with vertex set V , and edge set E) is defined by the following 
Boltzmann distributions over x = {xi : i E T^}, with Xj E {+1,-1} 

A^fe) = y(l „x exp i[i ^ XiXj + B^Xi\. (1.1) 

These distributions are parametrized by the 'magnetic field' B and 'inverse temperature' /? > 0, where the 
partition function Z{f3,B) is fixed by the normalization condition '^^^^{x) = 1. Throughout the paper, 
we will be interested in sequences of graphs Gn = iVn = En) of diverging size n. 

Non-rigorous statistical mechanics techniques, such as the 'replica' and 'cavity methods,' allow to make 
a number of predictions on the model (jl.l|) . when the graph G 'lacks any finite-dimensional structure.' 
The most basic quantity in this context is the asymptotic free entropy density 

(l)iP,B) = lim - log ZniP,B) (1.2) 

n— >oo n 

(this quantity is sometimes called in the literature also free energy or pressure). The limit free entropy 
density and the large deviation properties of Boltzmann distribution were characterized in great detail [1] 
in the case of a complete graph Gn = Kn (the inverse temperature must then be scaled by 1/n to get a 
non-trivial limit). Statistical physics predictions exist however for a much wider class of graphs, including 
most notably sparse random graphs with bounded average degree, see for instance O [3l H]. This is a 
direction of interest for at least two reasons: 



*Departments of Statistics and Mathematics, Stanford University 
^Departments of Electrical Engineering and Statistics, Stanford University 
Research partially supported by NSF grant #DMS-0806211. 

AMS (2000) Subject Classification: Primary: 82B44 Secondary; 82B23, 60F10, 60K35, 05C80, 05C05 

Keywords: Ising model, random sparse graphs, cavity method, Bethe measures, belief propagation, local weak convergence. 



1 



(i) Sparse graphical structures arise in a number of problems from combinatorics and theoretical computer 
science. Examples include random satisfiability, coloring of random graphs, graph partitioning [5j|. In all 
of these cases, the uniform measure over solutions can be regarded as the Boltzmann distribution for a 
modified spin glass with multi-spin interactions. Such problems have been successfully attacked using non 
rigorous statistical mechanics techniques. 

A mathematical foundation of this approach is still lacking, and would be extremely useful. 

(ii) Sparse graphs allow to introduce a non-trivial notion of distance between vertices, namely the length 
of the shortest path connecting them. This geometrical structure allows for new characterizations of the 
measure p.ip in terms of correlation decay. This type of characterization is in turn related to the theory 
of Gibbs measures on infinite trees [6]. 

The asymptotic free entropy density (II. 2p was determined rigorously only in a few cases for sparse 
graphs. In [7j, this task was accomplished for random regular graphs. De Santis and Guerra developed 
interpolation techniques for random graphs with independent edges (Erdos-Renyi type) but only deter- 
mined the free entropy density at high temperature and at zero temperature (in both cases with vanishing 
magnetic field). The latter is in fact equivalent to counting the number of connected components of a 
random graph. 

In this paper we generalize the previous results by considering generic graph sequences that converge 
locally to trees. Indeed, we control the free entropy density by proving that the Boltzmann measure (jl.ip 
converges locally to the Boltzmann measure of a model on a tree. The philosophy is related to the local 
weak convergence method of [9J. 

Finally, several of the proofs have an algorithmic interpretation, providing an efficient procedure for 
approximating the local marginals of the Boltzmann measure. The essence of this procedure consists 
in solving by iteration certain mean field (cavity) equations. Such an algorithm is known in artificial 
intelligence and computer science under the name of belief propagation. Despite its success and wide 
applicability, only weak performance guarantees have been proved so far. Typically, it is possible to prove 
its correctness in the high temperature regime, as a consequence of a uniform decay of correlations holding 
there (spatial mixing) [10\ lllj. The behavior of iterative inference algorithms on Ising models was recently 
considered in [I2l [13] . 

The emphasis of the present paper is on the low-temperature regime in which uniform decorrelation 
does not hold. We are able to prove that belief propagation converges exponentially fast on any graph, and 
that the resulting estimates are asymptotically exact for large locally tree-like graphs. The main idea is to 
introduce a magnetic field to break explicitly the +/— symmetry, and to carefully exploit the monotonicity 
properties of the model. 

A key step consists in estimating the correlation between the root spin of an Ising model on a tree, 
and positive boudary conditions. Ising models on trees are interesting per se, and have been the object of 
significant mathematical work, see for instance \14\ [T5 l [T6]. The question considered here appears however 
to be novel. 

The next section provides the basic technical definitions (in particular concerning graphs and local 
convergence to trees), and the formal statement of our main results. Notations and certain key tools are 
described in Section [3] with Section [J] devoted to proofs of the relevant properties of Ising models on trees 
(which are of independent interest). The latter are used in Sections [5] and [6] to derive our main results 
concerning models on tree- like graphs. A companion paper [IT] deals with the related challenging problem 
of spin glass models on sparse graphs. 
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2 Definitions and main results 



The next subsections contain some basic definitions on graph sequences and the notion of local convergence 
to random trees. Sections 12.21 and 12.31 present our results on the free entropy density and the algorithmic 
implications of our analysis. 

2.1 Locally tree-like graphs 

Let P = {Pfc : k > 0} a probability distribution over the non-negative integers, with finite, positive first 
moment, and denote by 

2^1=1 

its size-biased version. For any t > 0, we let T(P, p, t) denote the random rooted tree generated as follows. 
First draw an integer k with distribution P^, and connect the root to k offspring. Then recursively, for 
each node in the last generation, generate an integer k independently with distribution p^, and connect 
the node to fc — 1 new nodes. This is repeated until the tree has t generations. 

Sometimes it will be useful to consider the ensemble T(p, t) whereby the root node has degree k — 1 with 
probability pk- We will drop the degree distribution arguments from T{P,p,t) or T{p,t) and write T(t) 
whenever clear from the context. Notice that the infinite trees T(P, p, oo) and T{p, oo) are well defined. 

Probability with respect to the random tree ensemble is denoted by Pp{ • }. The average branching 
factor of trees will be denoted by p, and the average root degree by P. In formulae 



P = J2kPk, ■p = Y,{k-l)pk. (2.2) 



fc=0 k=l 

We denote by G„ = {Vn-,En) a graph with vertex set Vn = [n] = {!,..., n}. The distance d{i,j) 
between i,j G Vn is the length of the shortest path from i to j in G„. Given a vertex i € Vn, we let 
Bj(t) be the set of vertices whose distance from i is at most t. With a slight abuse of notation, Bj(t) will 
also denote the subgraph induced by those vertices. For i € Vn, we let di denote the set of its neighbors 
di = {j G Vn ■ G En], and \di\ its size (i.e. the degree of i). 

This paper is concerned by sequence of graphs {Gn}n&M of diverging size, that converge locally to trees. 
Consider two trees Ti and T2 with vertices labeled arbitrarily. We shall write Ti ~ T2 if the two trees 
become identical when vertices are relabeled from 1 to |ri| = IT2I, in a breadth first fashion, and following 
lexicographic order among siblings. 

Definition 2.1. Considering a sequence of graphs {Gn}n&M, let P„ denote the law induced on the ball 
Bj(t) in Gn centered at a uniformly chosen random vertex i S [n]. We say that converges locally to 

the random tree T(P, p, 00) if, for any t, and any rooted tree T with t generations 

lim P„{Bi(t) ~ T} = Pp{T(t) ~ T} . (2.3) 

n— >oo 

We say that {Gn} is uniformly sparse if there exists a sequence e{l) | 0, such that for any n 

J2 \di\Ii\di\ > I) < ne{l) . (2.4) 
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2.2 Free entropy 

According to the statistical physics derivation, the model (jl.ip has a line of first order phase transitions 
for B = and (3 > fic (that is, where the continuous function B ^ <P{13, B) exhibits a discontinuous 
derivative). The critical temperature depends on the graph only through the average branching factor and 
is determined by the condition 

p (tanh /?c) = 1 . (2.5) 

Notice that /3c — l//? for large degrees. 

The asymptotic free-entropy density is given in terms of the fixed point of a distributional recursion. 
One characterization of this fixed point is as follows. 

Lemma 2.2. Consider the sequence of random variables {/i^*)} defined by h^'^^ = identically and, for 
t > 0, 

h(^+^)^B+J^aP,h?), (2.6) 

i=l 

where K is an integer valued random variable of distribution p, 

h) = atanh[tanh(/?) tanh(/i)] , (2.7) 

and the hf^ 's are i.i.d. copies of h^^^ that are independent of K. If B > and p has finite first moment, 
then the distributions of /i^*^ are stochastically monotone and /i^*) converges in distribution to the unique 
fixed point h* of the recursion h2. 6|) that is supported on [0, oo). 

Our next result confirms the statistical physics prediction for the free entropy density. 

Theorem 2.3. Let {GnjnGW be a sequence of uniformly sparse graphs that converges locally to T(P, p, oo). 
// p has finite first moment (and hence P has finite second moment), then for any i? E R and /? > 0, 

lim ilogZ„(/3,5) =0(/3,5), (2.8) 

n— >oo n 

where taking L of distribution Pi independently of the 'cavity fields' hi that are i.i.d. copies of the fixed 
point h* of Lemma [2^ (/)(/?, B) = (j){(3, —B) is given for B > by 

P P 

(p{/3, B) = — log cosh(/5) - ^ E log[l + tanh(/3) tanh(/ii) tanh(/i2)] 



n L 

+ Elog|e^JJ[l + tanh(/3)tanh(/ii)]+e~^JJ[l-tanh(^)tanh(/ii)]|, (2. 

i=l i=l 



and (j){l3,0) is the limit of (j){(3,B) as B ^ 0. 

The proof of Theorem 12.31 is based on two steps 

(a) Reduce the computation of (j)n{f3,B) = ^ log Z„(/3, i?) to computing expectations of local (in G„) 
quantities with respect to the Boltzmann measure This is achieved by noticing that the 
derivative of (/'n(/3, B) with respect to /3 is a sum of such expectations. 

(b) Show that expectations of local quantities on Gn are well approximated by the same expectations 
with respect to an Ising model on the associated tree T(P, p, t) (for t and n large.) This is proved by 
showing that, on such a tree, local expectations are insensitive to boundary conditions that dominate 
stochastically free boundaries. The thesis then follows by monotonicity arguments. 
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The key step is of course the last one. A stronger requirement would be that these expectation values 
are insensitive to any boundary condition, which would coincide with uniqueness of the Gibbs measure on 
T(P, p, oo). Such a requirement would allow for an elementary proof, but holds only at 'high' temperature, 

Indeed, insensitivity to positive boundary conditions is proved in Section [J] for a large family of 'con- 
ditionally independent' trees. Beyond the random tree T{P, p,oo), these include deterministic trees with 
bounded degrees and multi-type branching processes. This result allows to generalize Theorem 12. 31 to other 
graph sequences, that converge locally to random trees different from T(P, p, oo). A simple example would 
be the one of uniformly random bipartite graphs with two degree distributions Pk and for the two types 
of vertices. We refrain from formalizing such generalizations in order not to burden our presentation. 



2.3 Algorithmic implications 

The free entropy density is not the only quantity that can be characterized for Ising models on locally 
tree-like graphs. Indeed local marginals can be efficiently computed with good accuracy. The basic idea 
is to solve a set of mean field equations iteratively. These are known as Bethe-Peierls or cavity equations 
and the corresponding algorithm is referred to as 'belief propagation' (BP). 

More precisely, associate to each directed edge in the graph i j, with £ G, a distribution 

Vi^j{xi) over Xi G 1}. In the computer science literature these distributions are referred to as 

'messages'. They are updated as follows 

4^l\x.) = ^ e''^^ n (2.10) 

^i^j ledi\j 

The initial conditions T^f^ji ■ ) may be taken to be uniform or chosen according to some heuristic. We will 

say that the initial condition is positive if fj-!!^j(+l) > I'^ji—^) for each of these messages. 

Our next result concerns the uniform exponential convergence of the BP iteration to the same fixed 
point of (j2.10p . irrespective of its positive initial condition. 

Theorem 2.4. Assume P > 0, B > and G is a graph of finite maximal degree A. Then, there exists 
A = A{I3,B,A) finite, A = X{(3,B,A) > and a fixed point {i^i^j} of the BP iteration 112.10]) such that 

for any positive initial condition and all t > 0, 

sup -z^*_Jtv< Aexp(-At). (2.11) 

For i^: € V let U = Bj, (r) be the ball of radius r around in G, denoting by Ejj its edge set, by dU 
its border (i.e. the set of its vertices at distance r from i^,), and for each i S dU let j{i) denote any one 
fixed neighbor of i in [/. 

Our next result shows that the probability distribution 

J^uixu) = ^explp Y XiXj + B Y Xi\ W v*_^^^){x.i) , (2.12) 

^ [ ieU\dU J i&dU 

with the fixed point of the BP iteration per Theorem 12.41 is a good approximation for the 

marginal fJ-ui') of variables Xjj = {xi : i G U} under the Ising model (jl.ip . 

Theorem 2.5. Assume P > 0, B > and G is a graph of finite maximal degree A. Then, there exist 
finite c = c{P,B,A) and A = X{P,B,A) > such that for any £ G and U = Bj^(r), if Bj^(t) is a tree 
then 

||/^[/-z^c/||Tv<exp{c"+i-A(t-r)} . (2.13) 
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2.4 Examples 



Many common random graph ensembles [18] naturally fit our framework. 

Random regular graphs. Let G„ be a uniformly random graph with degree k. As n — > oo, the sequence 
{Gn} is obviously uniformly sparse, and converges locally almost surely to the random rooted Cayley tree of 
degree k. Therefore, in this case Theorem 12.31 applies with = 1 and = for i ^ k. The distributional 
recursion ()2.6p then evolves with a deterministic sequence recovering the result of f7]. 

Erdds-Renyi graphs. Let Gn be a uniformly random graph with m = n'j edges over n vertices. The 
sequence {Gn} converges locally almost surely to a Galton- Watson tree with Poisson offspring distribution 
of mean 27. This corresponds to taking = {2'y)^e~'^'^ /k\. The same happens to classical variants of this 
ensemble. For instance, one can add an edge independently for each pair with probability 27/n, or 
consider a multi-graph with Poisson(27/n) edges between each pair 

Li all these cases {Gn} is almost surely uniformly sparse. In particular Theorem 12.31 extends the results 
of [8j to arbitrary non-zero temperature and magnetic field. 

Arbitrary degree distribution. Let P be a distribution with finite second moment and G„ a uniformly 
random graph with degree distribution P (the number of vertices of degree k is obtained by rounding nPk). 
Then {Gn} is almost surely uniformly sparse and converges locally to T(P, p, 00). The same happens if Gn 
is drawn according to the so-called configuration model (c.f. [19]). 



3 Preliminaries 

We review here the notations and a couple of classical tools we use throughout this paper. To this end, 
when proving our results it is useful to allow for vertex-dependent magnetic fields -Bj, that is, to replace 
the basic model (II. ip by 

Given U <Z V, we denote by {+)u (respectively {—)u) the vector {xi = +1, i G U} (respectively, 
{xi = —1, i € U}), dropping the subscript U whenever clear from the context. Further, we use Xjj ^ 
when two real valued vectors x and x' are such that Xi < x\ for alH E f/ and say that a distribution pui') 
over is dominated by a distribution />[/( • ) over this set (denoted pu ^ p'jj), if the two distributions can 
be coupled so that Xjj ^ x/jj for any pair {xu,x'u) drawn from this coupling. Finally, we use throughout 
the shorthand {u, f) = f{x)v{x) for a distribution v and function / on the same finite set, or (/) when 
v is clear from the context. 

The first classical result we need is Griffiths inequality (see Theorem IV. 1.21]). 

Theorem 3.1. Consider two Ising models /u( • ) and p'{ ■ ) on graphs G = {V, E) and G' = {V, E'), inverse 
temperatures j3 and 13' , and magnetic fields {Bi} and {B'^, respectively. If E C E' , (3 < (3' andO < Bi < B'- 
for all i € V, then < (^jHiGt/ - (/^Mljec/^*) M any U CV. 

The second classical result we use is the GHS inequality (see |21j ) about the effect of the magnetic field 
S on the local magnetizations at various vertices. 

Theorem 3.2 (Griffiths, Hurst, Sherman). Let /? > and for B_ = {Bi : i G V}, denote by mj{B_) = 
p{{x : Xj = +1}) — p{{x : Xj = —1}) the local magnetization at vertex j in the Ising model 113. If Bi > 
for all i G y, then for any three vertices j, k,l (^V (not necessarily distinct), 

d'^rujiB) 



dBM 



<0. (3.2) 
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Finally, we need the following elementary inequality 

Lemma 3.3. For any function f : X [0, /max] CLiT'd distributions v, v' on the finite set X such that 
v{f > 0) > and v'{f > 0) > 0, 



v{x)f{x) v'{x)f{x) 



n — /\ / / f\\ \\^-^ TV 3.3) 

max((zy,/), {v',f)) 



In particular, if < /min < /C^;), then the right hand side is bounded by (3/max//min)i|i^ — z^'Htv- 

Proof. Assuming without loss of generality that {v' , f) > {v,f) > 0, the left hand side of (jS.Sp can be 
bounded as 



L^^\^(x)fix){i.',f)-i.'ix)fix){i.J)\ < 



^ /max II /II , 2/iiiax n /n 

This implies the thesis. □ 



4 Ising models on trees 



We prove in this section certain facts about Ising models on trees which are of independent interest and as a 
byproduct we deduce Lemma [2.2l and the theorems of Section [2.31 In doing so, let T denote a conditionally 
independent infinite tree rooted at the vertex 0. That is, for each integer A; > 0, conditional on the subtree 
T{h) of the first k generations of T, the number of offspring Aj for j E dT{k) are independent of each 
other, where dT{k) denotes the set of vertices at generation k. We further assume that the (conditional on 
T(/c)) first moments of Aj are uniformly bounded by a given non-random finite constant A. In addition to 
T = T(P, /9, oo) this flexible framework accommodates for example random bipartite trees, deterministic 
trees of bounded degree and percolation clusters on them. 

For each i >! the Ising models on T(^) with free and plus boundary conditions are then 



1 



exp 



13 ^ XiX 

(ii)GTW 



(4.1) 



/'+(x) 



exp 



(3 ^ ^ Xi Xj 



IfeT(^) = i+)dT(e)) 



(4.2) 



Equivalently /'O is the Ising model (13. ip on T(^) with magnetic fields {Bi} and fi^'^ is the modified Ising 
model corresponding to the limit Bi ] +oo for all i G dT{£). To simplify our notations we denote such 
limits hereafter simply by setting Bi = +oo and use /u^ for statements that apply to both free and plus 
boundary conditions. 

We start with the following simple, but useful observation. 

Lemma 4.1. For a subtree U of a finite tree T let d^U denote the subset of vertices ofU connected by an 
edge to W = T \ U and for each u ^ d^U let {xu)w denote the root magnetization of the Ising model on 
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the maximal subtree of W U {u} rooted at u. The marginal on U of the Ising measure on T, denoted 
njj is then an Ising measure on U with magnetic field B'^ = atanh.{{xu)w) ^ Bu for u e 9*C/ and B'^ = Bu 
for u ^ 9*C/. 

Proof. Since C/ is a subtree of the tree T, the subtrees for u S (9*C/ are disjoint. Therefore, with fiu{x) 
denoting the Ising model distribution for we have that 

'-ifinoj) n '^«(^«)' (4.3) 



for the Boltzmann weight 



= exp|/3 ^ XuXv+ ^ BuX^ . 

{uv)£U u£U\d,U 

Further, x^ £ {+1, —1} so for each u e d^^U and some constants c^, 

1 

fi-u{xu) = -{I + Xu{xu)w) = c„exp(atanh((xn)vi/)x„) . 

Embedding the normahzation constants Cu within Z we thus conclude that /i^ is an Ising measure on U 
with the stated magnetic field B'^. Finally, comparing the root magnetization for with that for {u} we 
have by Griffiths inequality that {xu)w ^ tanh(i?u), as claimed. □ 



Theorem 4.2. Suppose T is a conditionally independent infinite tree of average offspring numbers bounded 
by A. For < Bmin < -Bmax, /3max and A finite, there exist M = M(/3max, ^min, A) and C = C(/3max, ^max) 
finite such that if Bi < i?max for all i G T(r — 1) and Bi > Bmm for all i E T(^), i > r, then 

E - M&°IItv < 5ii - r) E{ClTMI} , (4.4) 

for 5{t) = M/t, all U C T(r) and /3 < /3max. 

Proof. Fixing ^ > r it suffices to consider U = T(r) (for which the left side of (j4.4p is maximal). For this U 
and T = T{£) we have that 5*f7 = dT{r) and U\d^U = T(r — 1), where in this case the Boltzmann weight 
/(•) in dO]) is bounded above by /max = d'''^'')! and below by fmin = l//max for c = exp(/3max + -Bmax)- 
Further, the plus and free boundary conditions then differ in (14. 3p by having the corresponding boundary 

conditions at generation ^ — r of each subtree T^, which we distinguish by using fit^^{xu) instead of fiu{xu)- 
Since the total variation distance between two product measures is at most the sum of the distance between 
their marginals, upon applying Lemma 13.31 we deduce from (14. 3p that 

II^'tm-^'tmIItv<^c2|"'('-^i E l/i.+(x. = i)-/i°(x. = i)|. 

je9T(r) 

By our assumptions, conditional on f7 = T(r), the subtrees Tj of T = T(^) denoted hereafter also by Tj are 
for i £ dT{r) independent of each other. Further, 2fL^^^{xi = 1) — 1 is precisely the magnetization of their 
root vertex under plus/free boundary conditions at generation £ — r. Thus, taking C = ec? (and using the 
inequality y < e^), it suffices to show that the magnetizations m^'^^'^{B_) = {fj,^'^^^ , X0) at the root of any 
such conditionally independent infinite tree T satisfy 

E{m^'+(S) - m^'0(5)} < ^ , (4.5) 
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for some M = M(/3max> -Bmin, A) finite, all f3 < /?max and £ > 1, where we have removed the absolute value 
since mf''~^{B) > m^'^{B_) by Griffiths inequality. 

Note that (j4.5p trivially holds for /3 = (in which case ^^'^{x0) = ^^'^{x0)). Assuming hereafter that 
/3 > we proceed to prove ()4.5p when each vertex of T(£ — 1) has a non-zero offspring number. To this 
end, for ^ = {//^ G E : i G dJ{k)] let 

l/'—{x) = ^^exp|/3 ^ XiXj+ ^ -B,;Xi+ ^ -f^x^j 

fe')eT(fe) jeT(fc) ieeT(fc) 

and denote by m''{B_,H_) the corresponding root magnetization. Writing H instead of H_ for constant 
magnetic field on the leave nodes, that is, when Hi = H for each i G dT{k), we note that m^'~^{B) = 
m^{B_,oo) and m^'^{B) = m^{B_,0). Further, applying Lemma l4.ll for the subtree T(A: — 1) of T(A;) we 
represent m^{B_, oo) as the root magnetization m^~^{B^, 0) on T(A;— 1) where B[ = Bi-\-f3Ai for i G dT{k—l) 
and B'^ = Bi for all other i. Consequently, 

m\B, oo) = m'^-^B, {/3AJ) . (4.6) 

Recall that if g^j- < for i = then applying Jensen's inequality one variable at a time we 

have that Wi g(Zi, . . . , Zs) < g(E Zi, . . . ,'E Zg) for any independent random variables Zi, . . . , Zg- By the 
GHS inequality, this is the case for H_ i— s- 'm^~^ {B_, H_) , hence with E^, denoting the conditional on T(A;) 
expectation over the independent offspring numbers Aj for i G dT{k), we deduce that 

Efc_i m\B, oo) < m''-\B, {/3Efc_iAJ) < m''-\B, /3A) , (4.7) 

where the last inequality is a consequence of Griffiths inequality and our assumption that E^Aj < A for 
any i G dT{t) and all t > 0. Since each i G dT{k — 1) has at least one offspring whose magnetic field is at 
least -Bmin) it follows by Griffiths inequality that m^'^{B_) is bounded below by the magnetization at the 
root of the subtree T of T(A;) where A, = 1 for all i G dT{k — 1) and Bi = i?min for all i G dT{k). Applying 
Lemma l4.ll for T and U = T{k — 1), the root magnetization for the Ising distribution on T turns out to 
be precisely m^~^{B_,£) for ^ = ^(/3, -Bmin) > of (j2.7p . Thus, one more application of Griffiths inequality 
yields that 

mJ^iB, 0) > m*^-i(5, > m''-\B, 0) . (4.8) 
Next note that B) < (3 < pA and by GHS inequality H ^ m''-^{B, H) is concave. Hence, 

m!'-^{B,l3A)-m^-^{B,{)) < M[m''-^{B,i) - m^-^{B,{))] , (4.9) 
for the finite constant 

M- /9A 

0</3</3max ?IP)-Dminj 

and all /3 < /3max- Combining (fi^ and (fOj) we obtain that 

Efc_i{m'='+(S) - m^'°(fi)} < m'^-H^, A/?) - m*^-i(5, 0) 

< M[m^-^{B,i)-m^'\B,Q)] < M[m'=(5, 0) - m'=-^(S, 0)] . 

We have seen in gS]) that /c m^^^{B) is non-decreasing whereas from ()4.6p and Griffiths inequality we 
have that A; m'^'^(S) is non- increasing. With magnetization bounded above by one, we thus get upon 
summing the preceding inequalities for = 1 , . . . , £ that 

mi-i[m^'+{B) - m^'°(B)] < ^Efc_i[m'='+(^) - m'='°(B)] < M, 

fe=i 
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from which we deduce (j4.5p . 

Considering now the general case where the infinite tree T has vertices (other than the root) of degree 
one, let T*{i) denote the 'backbone' of T(^), that is, the subtree induced by vertices along self-avoiding 
paths between and dT{£). Taking U = T*{i) as the subtree of T = T(^) in Lemma [4. 11 note that for each 
u £ 8^:11 the subtree T„ contains no vertex from dT{i). Consequently, the marginal measures jJ'ir^^'^ are 
Ising measures on U with the same magnetic fields B'- > Bi > -Bmin outside dT{£). Thus, with 'mi'~^^^{B_) 
denoting the corresponding magnetizations at the root for T*(^), we deduce that m^'^^^{B_) = mf'^^^(B') 
where B[ > Bi > .Bmin for all i. By definition every vertex of T*(£ — 1) has a non-zero offspring number 
and with B[ > -Bmin; the required bound 

E{m^'+(5) - m^^\B)} = E{m^'+(^') - m^f{B^)} < ^ 

follows by the preceding argument, since T*(^) is a conditionally independent tree whose offspring numbers 
A* > 1 do not exceed those of T{i). Indeed, for A: = 0, 1, — 1, given T*{k) the offspring numbers at 
i € dT* (k) are independent of each other (with probability of { = s} proportional to the sum over t >0 
of the product of the probability of {Aj = s + t} and that of precisely s out of the s + t offspring of i in 
T(£) having a line of descendants that survives additional I — k — 1 generations, for s > 1). □ 

Simon's inequality, see [22[ Theorem 2.1] allows one to bound the (centered) two point correlation 
functions in ferromagnetic Ising models with zero magnetic field. We provide next its generalization to 
arbitrary magnetic field, in the case of Ising models on trees. 

Lemma 4.3. // edge is on the unique path from to k £ T(£), with j a descendant of i £ dT{t), 

t > 0, then 

Xk)^^^ < cosh2(2/3 + Bi) {x^; Xi)f{xj; Xk)f , (4.10) 

where {■)[^^ denotes the expectation with respect to the Ising distribution fii{-) on the subtree Tj of i and 
all its descendants in T(r) and {x;y) = {xy) — {x){y) denotes the centered two point correlation function. 

Proof. It is not hard to check that if x, y, z are {+1, — l}-valued random variables with x and z conditionally 
independent given y, then 

In particular, under fj,^'^ the random variables X0 and x^ are conditionally independent given y = Xi with 

/fit 



log 



/•"(x, = +l) x 
fi'fi{xi = -l)) 



< 2{\di\P + B,). 



Hence, if j is the unique descendant of i then \{xi)^0^\ < tanh(2/3 + Bi) and we get from (j4.1ip that 

{x0',Xif)0 ^c{x0',Xi)0 {Xi',Xj^)0 

for c = cosh^(2/3 + i?j). Next note that {x; < 1 — (y)"^ for any two {+1, — l}-valued random variables, and 
since Xi and x^ are conditionally independent given y = xj it follows from (j4.1ip that {xf, Xk)^0^ < {xj; x^)'^^ ■ 
Further, if (•) is the expectation with respect to an Ising measure for some (finite) graph G then for any 
u,v € G 

n 7-> — {XyXu) {Xv){Xu} — {XvjXu) ■ {'^•^'^^ 
OBu 
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From Lemma 14.11 we know that computing the marginal of the Ising distribution for T = T{£) on a 
smaller subtree U = Tj of interest has the effect of increasing its magnetic field. Thus, combining the 
identity (|4.12p with GHS inequality, we see that reducing this field (i.e. restricting to U the original 
Ising distribution), increases the centered two point correlation function. That is, (xj;x/^)0'^ ^ (xj^x^)^- ^. 

Similarly, considering Lemma l4.ll for U = T{t) we also have that {x0]Xi)'^^ < {x^^Xii^^"^ which completes 
our thesis in case j is the unique descendant of i. 

Turning to the general case, we compare the thesis of the lemma for T(£) and the subtree U = T'{€) 
obtained upon deleting the subtrees rooted at descendants of i (and the corresponding edges to i) except 
for Jj. While {x0;xi)0^ and {xj;Xk)f^ are unchanged by this modification of the underlying tree (as the 

relevant subgraphs are not modified), we have from Lemma 14.11 that fJ.if{-) is an Ising measure on U 
identical to the original but for an increase in the magnetic field at i. In view of (j4.12p and the GHS 
inequality, we thus deduce that the value of {x^^Xk)^^'^ is smaller for the Ising model on T(^) than for the 
one on T'(£) and since in T'(£) the vertex j is the unique descendant of i, we are done. □ 

Equipped with the preceding lemma we next establish the exponential decay of correlations and of the 
effect of boundary conditions in Theorem 14. 2[ 



Corollary 4.4. There exist A finite and X positive, depending only on /3max; ^min; ^max; cind A such that 

e{ ^ {x0;x^)^^^]<Ae~^^ (4.13) 



for any r < £ and if Bi < i?max for all i € T{£ — 1) then Theorem \4.S\ holds for 6{t) = ^exp(— At). 

Remark. Taking Bi ] +oo for i £ dT{£), note that |^..?g| ) applies when { ■ )^^^ is with respect to /i^'^( • ). 

Proof. Starting with the proof of (|4.13p take £ = r for which the left side is maximal (as we have seen while 
proving Lemma l4.3p . Then, denoting by {■)Hr the expectation under the Ising measure on T(r) with a 
magnetic field Hr added to S at all vertices i G dT{r), it follows from (j4.12p that 



(r) _ d{x0) _ d{x0)Hr 



dBi dHr 

iedT{r) iedT{r) 



Hr=0 



By GHS inequality the latter derivative is non-increasing in Hr, whence 

(X0;xi)f^ < [{x0)Hr=O - {x0)Hr=~B^iJ2\■ 
Let B'- = Bi — i?min/2 if i E dT{r) and B'- = Bi otherwise, so {x0)Hr=~B^in/2 = m'^'^{B_'). Further, from 
Griffiths inequality also {x0)Hr=o < {x0)Hr=oo = m'"'+(B') and it follows that 

r,^E{ < -^E{m'-'+(^')-m'^'0(^')}. (4.14) 



iedT{r) 



In particular, setting c = cosh^(2/3jnax + ^max)i in view of (|4.5p we find that Td-i/ < l/(ecA) for d = 
1 + [2ecAAf(/3max5 ^min/2, A)/i?min] . Further, since T is conditionally independent, the same proof shows 
that t + d = r' < r and Tj is the subtree of T(r) of depth d — 1 rooted at j G dT{t + 1) then 
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Considering the inequality (j4.10p of Lemma 14.31 for t = r — d = ri and all k E 9T(r) we find that 



r. < ce{ (%;x,)i*)Et+i[ J] ]} 



jedT{t+l)ndi 



iedT(t) 

Iterating the preceding bound at = r — sd, for s = 1, . . . , lr/d\ and noting that by (j4.14p we have the 
bound Tr' < 2/i?min at the last step, we get the uniform in /3 < /3max exponential decay of (14.13p . 

Next, recall that the rate 5{t) in Theorem 14.21 is merely the rate in the bound (j4.5p . For k = \dT(£)\ we 
choose uniformly and independently of everything else a one to one mapping i : {1, . . . ,k} dT(i), and 
let S^^-* for s > 1 denote the magnetic field configuration obtained when taking -Bj(j) T +oo for all j < s 
(with =B). Since 

k—l 

we get the rate 5{t) = A exp(— At) from (j4.13p as soon as we show that for i = i(s + 1) and s = 0, . . . , /c — 1, 

To this end, let (•)s denote the expectation under n^'^ with magnetic field B^^^ so •n/'^{B^^^) = {x0)s. 
Further, fixing i = i{s + 1) 

^£,0/^(s+l)\ _ = _ {x0Xi)s + fa)^ 

'~ {I{Xi = l))s ~ l + {Xi)s 

(since I(xi = 1) = (1 + Xi)/2). Since (xi)^ > by Griffiths inequality, it follows that 

OBi 

which by GHS inequality is maximal at s = 0, yielding ()4.15p and completing the proof. □ 



As promised, Lemma 12.21 follows from the preceding results. 
Proof of Lemma 12.21 Consider the Galton- Watson tree T(p, oo) of Section 12.11 and the corresponding 
Ising models ;U*'~*"/'^(x) of constant magnetic field Bi = B > on the subtrees T(p, t). It is easy to check that 
the random variables h^^^ = atanh(m*'^(i?)) satisfy the distributional recursion 

([22]) starting at = 0. 

By Griffiths inequality m*'''(i?), hence is non-decreasing in t, and so converges almost surely as t — > oo 
to a limiting random variable h* . Further, the bounds = h^^^ < /i*-*-* < B + A0 hold for all t and hence 
also for h* . We thus deduce that the distributions Qt of /i^*) as determined by (|2.6p are stochastically 
monotone (in t) and converge weakly to some law Q* of h* that is supported on [0, 00). 

Next, recall that for any fixed k and F{-) continuous and bounded on R'^, the functional '^f{Q) = 
J F{hi, . . . , hk)dQ{hi) ■ ■ ■ dQ{hk) is continuous with respect to weak convergence of probability measures 
on [0,00) (for example, see j23i Lemma 7.3.12]). Fixing g( : IR 1— > [—C,C] continuous, clearly 

i-i 

gj{hi,...,hj)=g{B + Y,aP,hi)) 

i=l 
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are continuous and bounded. Further, it follows from (|2.6p that for all t 



J gdQt+i-Y,nK = j)^g,{Qt 



< CP{K > k) 



Taking t ^ oo followed by — > oo, we deduce by the preceding arguments (and the uniform boundedness 
< C for ah j), that 

/oo 
gdQ* = J2^{K = j)^g^{Q*). 
i=i 

As this applies for every bounded continuous function g{-), we conclude that h* and its law Q* are a fixed 
point of the distributional recursion (j2.6p . 

Next note that the random variables h^^ = atanh[m*''^(i?)] form a non-increasing sequence that satisfies 
the same distributional recursion, but with the initial condition /i^^ = +00. Consequently, by the same 
arguments we have used before, the laws Qt,+ of h^f^ converge weakly to some fixed point Q*, of (j2.6p 



that is also supported on [0, 00). Further, Qt ^ Q** ^ Qt,+ for t = and any (other) possible law Q** 
of a fixed point h** of p.6p that is supported on [0, 00). Coupling so as to have the same value of K, 
evidently the recursion (j2.6p preserves this stochastic order, which thus applies for all t. In the limit 
t ^ 00 we thus deduce that Q* ^ Q** ^ Q*^. Since p has finite first moment, by ()4.5|) of Theorem 14.21 

E|tanh(/i5^) - tanh(/iW)| — > as t — > 00. Thus, the expectation of the monotone increasing continuous 
and bounded function tanh(/i) is the same under both Q* and Necessarily this is also the expectation 
of tanh(/i) under Q** and the uniqueness of the non-negative fixed point of (j2.6p follows. □ 

We next control the dependence on /3 of the distribution of the fixed point h* from Lemma 12.21 

Lemma 4.5. Let \\X — y||MK denote the Monge-Kantorovich-Wasserstein distance between given laws of 
random variables X and Y (that is, the infimum o/E|X — Y\ over all couplings of X and Y). For any 
B > and /3max finite there exists a constant C = C{Praax, B) such that if h*p_^, h*^^ are the fixed points of 
the recursion 112. 6\) for < /3i,/32 < /?max; then 

||tanh(/i^J-tanh(^^J||MK <C|/32-/3i|. (4.16) 



Proof. Fixing a random tree T = T{p, 00) of degree distribution p, recall that while proving Lemma 
we provided a coupling of the random variables tanh(/i^) and the Ising root magnetizations m*'+/''(/3, S) 
at P such that 

m*'°(/3,5) < tanh(h^) < m*'+(/3,S) 

for each /3 and all t. By Griffiths inequality the magnetizations at the root are non-decreasing in /? so from 
the bound ()4.5p we get that for M = M(/?max, B,p) and any /3i < /32 < Pma.x, 

E|tanh(/i^J -tanh(/i^J| < Em*'°(/32, 5) - Em*'°(/3i, S) + ^ 



< {P2-P1) sup E — — } + — , 



dp ) t 



where the expectations are over the random tree T{p, 00). Considering t — > 00 it thus suffices to show that 
E[5m^'''/9/3] is bounded, uniformly in £ and /? < /3max- To this end, a straightforward calculation yields 

~Qfr^^'^^ = z2 i{x0^i^j) - {x0){XiXj)) , 
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with ( • ) denoting the expectation with respect to the Ising measure fi^'^. If i is on the path in T{£) between 
the root and j, then under the measure /d^'^ the variables X0 and xj are conditionally independent given 
Xi. Further, as Xi G {—1, 1} it is easy to check that in this case 



-1 and the conditional 



{x0XiXj^ i^X0)(x^iXj^ — ^{^X0^Xj^ , 

where 7 is the arithmetic mean of the conditional expected value of Xj for Xi 
expected value of Xj for Xi = 1. Thus, I7I < 1 and recalling (j4.12p that (3:0; xi) is non-negative by Griffiths 
inequality, we deduce that 



dp 



ieT(^-i) 



r=0 



where Aj denotes the offspring number at i € T and by (j4.12p 

iedT{r) ie9T(r) 



B=B 



(with m''{B_,ir) the root magnetization for the measure /i— '— of (j4.6p ). In view of Lemma l4.ll we have 
that m''{B,0) = m''-\B,H) for some non-negative vector H_. By GHS inequality we deduce that for any 
iGT(fe-l) 

dB,mHB,0) = dB,m^~\B,H) < dB,m''~HB,0) . 
Consequently, Vr^i is non-increasing in £ and 



E 



dm 



£-1 



i-i 



dl3 



r=0 



r=0 



r=0 



Further, m^{B_,0) is independent of the offspring numbers at dT{r) whose expectation with respect to the 
random tree T(p, 00) is p. Thus, applying ()4.13p of Corollary 14.41 for i = r, T = T{p, 00) and constant 
magnetic field, we find that for some A finite, A > 0, any r > and all (3 < /Jmax 



EVr 



pe[ dB,m'iB,0)\^^^]=pE[ ^ {x^; Xi)] <p Ae' 

iedT{r) iedT{r) 



Summing over r gives us the required uniform boundedness of E[9m^''^/9/3] in £ and 



□ 



5 Algorithms 

The theorems stated in Section 12.31 are in fact consequences of Corollary 14. 4[ 

Proof of Theorem 12.41 The proof is based on the well known representation of the iteration ()2.10p in 
terms of 'computation tree' [TOj. Namely, coincides with the marginal at the root of the Ising 

model (jl.ip on a properly constructed, deterministic tree T'^i^j{t) of t generations. While we refer to the 
literature for the precise definition of T'^i^j{t), here are some immediate properties: 

(a) One can construct an infinite tree T'^i^j{oo) such that, for any t, T'^i^j{t) is the subtree formed by 
the first t generations of T'^j^j(oo). 

(b) The maximal degree of T'^i^j(oo) is bounded by the maximal degree of G (and equal to the latter 
when G is connected). 
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(c) A positive initialization corresponds to adding Hi^j^ = atanh(i/|^^(+l) — u^^j^l—l)) non-negative to 
the field B on the t-th generation vertices of '^'^i-^j (t) ■ 

Denote by • ); ^i^ji ' ) the messages obtained under initializations = 1 and = 

(~^) ~ ^/^' respectively. By Griffiths inequality, is non-increasing in t, v^^j {+1) is non- 

decreasing in t and any positive initialization results with i^f2^j ( • ) such that 

-i5H+i)>-a,(+i)>-°:!?(+i). 

By Corollary [O] we have that u^Jfi+l) - (+1) < Ae"^* for all t > 0. Since yl < oo and A > 
depend only on /?, B and the maximal degree of G, this immediately yields our thesis. □ 

Proof of Theorem 12.51 We use an additional property of the computation tree: 

(d) If Bi{k) is a tree then J^i^j{k) is a tree rooted at z — > j whose vertices are the directed edges on the 
maximal subtree of Bj(A;) rooted at i that does not include j. 

Without loss of generality we may and shall assume that t > r. For U = Bj,(r) consider the local 
marginal approximations • ) defined as in (j2.12p except that the fixed point messages • ) 

at i G 9Bj^(r) are replaced by those obtained after {t — r) iterations starting at = 1 and 

u'^'}^^ (+1) = 1) = 1/2, respectively. Since Bi,{t) is a tree, here is necessarily the neighbor 

of i on the path from to i £ 5Bj^(r) and from the preceding property (d) we see that T'^j^j(j)(t — r) 
corresponds to the subtree of i and its lines of descendant in Bj^(i). By property (c) we thus have that 
z/J ( • ) and are the marginals on U of the Ising model z^"*" on G with = oo at all i ^ B (t) and the 

Ising model on the vertices of G and the edges within the tree Bj^(t). Such reasoning also shows that 
the probability measure uu of (j2.12p is the marginal on U of the Ising model v on vertices of G and edges 
of Bj^(f) with an additional non-negative magnetic field Hi^^ = 3'ta'nh(z^;*_^^(-|-l) — 1)) at 5Bj, (t). 

Consequently, with xp = HieF have by Griffiths inequality that for any F U 

and we deduce that for any F (^U, 

\{fJ-,XF) - {i^,xf)\ < {i^^jXf) - {f^^XF) < 2\\u^ - U^Wtv ■ 
Recall that since Xi G { — 1, 1}, for any possible value y = {yi,i £ U} of Xjj, 

Hxu = y) = 2-1^1 J](l + yiXi) = 2-l^l ^^ypXF, 
ieu FCU 

and with \yF\ < 1 it follows that 

\f^u{y) - i^u{y)\ = 2"'^'| ^ yF{{f^u,XF) - {i^u,xf))\ < max - {i^u,xf)\ < 2\\u^ - u^Wtv ■ 

FCU 

This applies for any of the 21^' possible values of x^, so 

• ) - • )IItv < 2l^l j|z.+ ( • ) - i^6r( • )IItv . 

Applying Corollarv 14.41 for the deterministic tree Bj, (t) rooted at i^,, we get the bound (j4.4p on the right 
side of the preceding inequality with 5{k) = Aexp(— A/c), some finite A and A > that depend only on /3, 
B and A. Thus, noting that \U\ = |B^(r)| < A^'+'^ + l we establish our thesis upon choosing c = c{A, G, A) 
large enough. □ 
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6 From trees to graphs 

We start with the fohowing technical lemma. 

Lemma 6.1. Consider a convex set /C C H and symmetric twice differentiable functions : IC^ ^ H with 
Fq constant, such that for some finite constant c, 



sup sup 



d'F, 



dxidx2 



< 2c. 



Suppose i.i.d. X,Xi G /C are such that £ ^'E\dxiFi{x, X2, ■ ■ ■ , X£)\ is bounded uniformly in I and x € /C 
and the independent, square-integrable, non-negative integer valued random variable L satisfies 



E[Ld,,FL{x,X2,...,XL)]=0, VxG/C. 
Then, for any i.i.d. Y,Yi ^ IC also independent of L, 

\E[Fl(Y,, ...,Yl)-Fl{Xi,...,Xl)]\< cE[L{L - 1)] \\X - Y\\i 



(6.1) 



(6.2) 



Proof. Our thesis trivially holds if either | |X — |mk — or ||x-y|| 

MK — 00, SO without loss of generality, 
fixing 7 > 1 we assume hereafter that {Xi, Yi) are i.i.d. pairs, independent on L and coupled in such a way 
thatE|Xi-yi| <7||x-y|| 

MK is finite. It is easy to check that almost surely, 



Fe{Yi, ...,Ye)- F,(Xi, . . . , X,) = J] A,F, + ^ f^^iY, - X,){Y, - X^) , 

i=l i^j 

where AjF^ = (Yi — Xi) dx^Fi^Xi, . . . ,tYi -\- {1 — t)Xi, . . . , X£)dt and each of the terms 
-1 rt Q2p 



(6.3) 



Jij 



Jo dxidxj 



{sYi + (1 - s)Xi, ...,tYi + {l- t)Xi, ...,sYi + {l- s)Xe)dsdt , 



is bounded by c. For i.i.d. {Xi,Yi), by the symmetry of the functions Fi with respect to their arguments, the 
assumed boundedness of £~^E\dx-^Fi{x, X2, . . . ,Xi)\ implies integrability of AjF^ with EAjF^ independent 
of i and £~^E|AiF£| uniformly bounded. This in turn implies the integrability of X^^Li Aji<L for any L 
square integrable and independent of {Xi,Yi), so by Fubini's theorem and our assumption ()6.ip . 



1=1 



E[^AiFi] =E[LAiFl] =e\{Yi-Xi) / E[Ldx,FLitYi + {I - t)Xi, X2, . . . , Xl)\ Xi,Yi]dt 



0. 



Thus, considering the expectation of (j6.3p . by the uniform boundedness of fj^j^ and the independence of L 
on the i.i.d. pairs {Xi,Yi), we deduce that 



E[Fl{Yi, . . . , Yi) - Fl(Xi, . . . , Xl)] \<cEY,\Yi- Xi\\Yj - Xj\ < j^cE[L{L - 1)] | |X - y | 



2 

MK 



Finally, taking 7 j 1 yields the bound (|6.2p . 



□ 
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Remark 6.2. It is not hard to adapt the proof of the lemma so as to replace Fi : IC >-^li by 0.5Fi{x, y) for 
a twice differentiable symmetric function Fi : K? ^ R. Taking Pi = P(L = i) the contribution of L = I 
to the left side of i6.1\) is then PiElOxj^Filx, X2)] and the bound \6.S\) is modified to 

^E[Fi(yi,y2) - Fi{Xi,X2)] + ^P,E[F,(n, ...,Yi)-Fi{Xi,...,X,)]\< c^L^] \\X - Y\\l^ . (6.4) 

Consider the functional h iph that, given a random variable h, evaluates the right hand side of 
Eq. ()2.9p . It is not hard to check that iph is well-defined and finite for every random variable h. The 
following corollary of Lemma |6. II plavs an important role in the proof of Theorem 12.31 



Corollary 6.3. There exist non- decreasing finite c{\(3\) such that if p < 00 and h* is a fixed point of the 
distributional identity ^2. 0\) for some /3, S G K, then 



|v;;,(AS)-(^;,*(/3,i3)| <c(|/3|)Pp||tanh(/i)-tanh(/i*)||^K- (6.5) 

Proof. Setting u = tanh(/3) so \u\ < 1, we verify the conditions of Lemma l6.ll when Xi are i.i.d. copies 
oi X = tanh(/i*) and Yi i.i.d. copies of y = tanh(/i), all of whom take values in /C = [—1,1] and are 
independent of the random variable L. We apply the lemma in this setting for the symmetric, twice 
differentiable functions 

Fi{xi,. . . ,xi) = ^ log(l + n3;iXj) + log|e-^JJ(l + nxi) + e"-^JJ(l-nxi)|, 



for £ >2, and as in Remark 16.21 

Fi(xi,X2) = -log(l + tiXiX2) +log{e^(l + nxi) +e"^(l - uxi)} + log {e^(l + nx2) + e~^(l - UX2)} . 
Indeed, setting i/j{x,y) = uy/{l + uxy) and for each £ > 1 



ge{x2, . . . , Xi) = tanh + atanh(tiXj) j , (6.6) 

i=2 

(so gi = tanh(i?)), it is not hard to verify that dxiFi{xi, X2) = ip{xi,gi) — ip{xi,X2) while for £ > 2 

1 ^ 

dxiF(,{xi,.. . ,Xi) = ilj{xi,ge{x2,. ■ ■ ,xe)) - -—-j'^7p{xi,Xj) . (6.7) 

i=2 

In particular, gi^ • ) are differentiable functions from /C^~^ to /C, such that dx2ge are uniformly bounded (by 
a = \u\/{l — u^)) and dyip{x,y) is uniformly bounded on (by b = \u\/{l — |u|)^). Consequently, dx^F^ 
and d'^Fijdx\dx2 are also uniformly bounded (by 2/(1 — |nj) and 6(a + l) = 2c(|/?|), respectively). Further, 

h* is a fixed point of (j2.6p . hence X\ = gKiX2, ■ ■ ■ ,Xk)- With Xi identically distributed and Ppk = kP^ 



we thus find as required in (j6.ip that 

PiE[dx,Fi{x,X2)] + Y.kPkE[dx,Fk{x,X2,...,Xk)] 



k>2 

00 



P{ ^pfcE[V^(x,5fe(X2,...,Xfc))]-E^(x,Xi)} =0. (6.8) 



k=l 
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Noting that E[L^] = P p our thesis is merely the bound ()6.4p upon confirming that 

for some constant Fq and that both series are absolutely summable. □ 

Let T(/?, oo) denote the infinite random tree obtained by 'gluing' two independent trees from the 
ensemble T(p, oo) through an extra edge e between their roots and considering e as the root of T(p, oo) 
denote by T{p, t) the subtree formed by its first t generations (i.e. consisting of e and the corresponding 
two independent copies from T{p,t)). An alternative way to sample from T(p, oo) is to have independent 
offspring number k — 1 with probability pk at each end of the root edge e and thereafter independently 
sample from this offspring distribution at each revealed new node of the tree. Equipped with these notations 
we have the following consequence of the local convergence of the graph sequence {Gn}- 

Lemma 6.4. Suppose a uniformly sparse graph sequence {Gn} converges locally to the random tree 
T(P, p, oo). Fixing a non-negative integer t, for each {i,j) € En denote the subgraph of Gn induced by 
vertices at distance at most t from by Bjj(t). Let F{-) be a fixed, bounded function on the collection 

of all possible subgraphs that may occur as Bij(t), such that F{Ti) = F{T2) whenever Ti ~ T2. Then, 

lim- F(B,,(t)) = |E{F(T(p,t))}. (6.9) 

Proof. Denoting by E(jj)(-) the expectation with respect to a uniformly chosen edge {i,j) in En, the left 
side of (|6.9p is merely (|£^„|/n)E(jj){F(Bij (t))}. A uniformly chosen edge can be sampled by first selecting 
a vertex i with probability proportional to its degree \di\ and then picking one of its neighbors j = j{i) 
uniformly. Thus, denoting by E„(-) the expectation with respect to a uniformly chosen random vertex 
i G [n], we have that 

F iFfR (t)V. JEn{|gi|F(B.,(.)(t))} 

Marking uniformly at random one offspring of in T(P,p,t + 1) (as corresponding to j{i)), let T*(t + 1) 
denote the subtree induced by vertices whose distance from either or its marked offspring is at most t. 
Since Bjj(j)(t) C Bj(t + 1) and with probability qt^k — > 1 as /c — > 00 the random tree T(P, p,t+l) belongs to 
the finite collection of trees with t + 1 generations and maximal degree at most k, it follows by dominated 
convergence and the local convergence of {Gn} that for any fixed I, 

lim En[\di\I{\di\<l)F{B,ja){t))] =Ep{A,I(A0 < /)F(T,(t + l))}, 

where Ep(-) and A0 denote expectations and the degree of the root, respectively, in T(P, p, 00). Similarly, 

lim E„{ \di\I{\di\ <l)} = EpA0l(A0 < /) . 

n— >oo 

Further, by the uniform sparsity of {Gn}, for any n and /, 



E4|9z|I(|5i| >/)F(B,,.(,)(t)) 



<e(l)\\F 



loo ) 
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with e{l) J, as Z — > oo. Since P has a finite first moment, A0 is integrable, so by the preceding, upon 
taking / ^ 00 we deduce by dominated convergence that 



E,{A,F(T,(t + l))} 
^t.., ^-^^ 



lini E(,,){F(B,,(t))} 



To complete the proof note that the right side of the last expression is precisely E{F(T(p, t))} and we have 
also shown that 2|^„|/n = E„{|9z|} EpA0 = P. □ 

Proof of Theorem [231 S ince = log Z^{^l3^ P) is invariant under 13 — > — 13 and is uniformly (in 

n) Lipschitz continuous in B with Lipschitz constant one, it suffices to fix i? > and show that (j)niP,B) 
converges as n ^ 00 to the predicted iph*{f3,B) of (j2.9p . whereby h* = h*^ is the unique fixed point of the 
recursion (j2.6p that is supported on [0, 00) (see Lemma [212]) . 

This is obviously true for /? = since (/>n(0, B) = log(2 cosh B) = (^/i(0, B). Next, denoting by ( • )„ the 
expectation with respect to the Ising measure on G„ (at parameters P and B), it is easy to see that 

dp<PniP,B) = - V {XiXj)n. (6.10) 

Clearly \dfs(pni(3, B)\ < \En\/n is bounded by the uniform sparsity of {Gn} so it is enough to show that the 
expression in (j6.10p converges to the partial derivative of (ph*^ (/?, B) with respect to /3. Turning to compute 
the latter derivative, by Lemma 14.51 and Corollary 16.31 we can ignore the dependence of /i^ on /?. That is, 
we simply compute the partial derivative in P of the expression (j2.9p while considering (the law of) hi to 
be fixed. Indeed, with notations u = tanh(/3) and Xj = tanh(/ij) as in the derivation of Corollary 16.31 ^ 
direct computation leads by the exchangeability of Xi to 

dp ^{(3, 5) = |u - |(1 - u^)nij{Xi,X2)] + (1 - u^)nLxl^{Xi,gL{X2, Xl))] , 

for Tp^Xju) = xy/{l + uxy) and gi{x2, ■ ■ ■ ,xi) of ()6.6p . Further, the fixed point property (16. 8p applies for 
any bounded measurable ip{-), so we deduce that 

E[Lij{XugLiX2, . . ■,Xl))] = PE[V(Xi,5x(X2, . . .,Xk))] = PE[V(Xi,X2)] . 

Consequently, it is not hard to verify that 

8,mB) = f E{^i±§^} = f e[(...,>^] , (6.11) 

where denotes the expectation with respect to the Ising model 

fij{xi,Xj) = — exp {l3xiXj + HiXi + Hjxj} , 

Zij 

on one edge (ij) and random magnetic fields Hi and Hj that are independent copies of h*^. 

In comparison, fixing a positive integer t, by Griffiths inequality the correlation {xiXj)n lies between 
the correlations Fo(Bjj(t)) = {xiXj)^ and F+(Bjj(t)) = {xiXj}'^ ^^^^ for the Ising model on the subgraph 

Bjj(t) with free and plus, respectively, boundary conditions at dBij{t). Thus, in view of ()6.10p 

i ^ Fo{Bij{t))<dpMP,B)<^ J2 F+{Bi,{t)), 
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and taking n — > oo we get by Lemma 16.41 that 

To compute i*o/+(T(p, i)) we first sum over the values of for k G T(/>, t) \ This has the effect 

of reducing Fo/_|_(T(/9, t)) to a form of {xiXj)^- Further, as shown in the proof of Lemma [2.21 we get 

FQ/_^_{T{p,t)) by setting for Hi and Hj two independent copies of the variables /i^*) and respectively, 
which converge in law to h*^ when t — > oo. We also saw there that the functional = 'E[{xiXj)j] (for 

continuous and bounded [/(-ffj, i/j) = (u + tanh(i?j) tanh(ifj))/(l + iitanh(i?j) tanh(i7j))), is continuous 
with respect to the weak convergence of the law i' of Hi. Consequently, by (j6.1ip 

hm ^nFo/+{T{p,m=dp^iP,B), 

t^oo z 

which completes the proof of the theorem. □ 
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